Optimized Privilege Evaluation #4380

nibix · 2024-05-30T09:58:15Z

Description

This implements the optimized privilege evaluation as described in #3870.

This introduces de-normalized data structures that are optimized for the checks that need to be done during privilege evaluation. Additionally, certain objects (like DLS queries) are prepared ahead of time, as early as possible in order to minimize the overhead during actual privilege evaluation.

This is a big change set - in order to facilitate the review, I have split it into three major commits:

Optimized action privilege evaluation
Optimized DLS/FLS/FM privilege evaluation
Removal of unused code

The code is extensively commented - I hope that will help during review.

Category: Enhancement
Why these changes are required?

Performance tests indicate that the OpenSearch security layer adds a noticeable overhead to the indexing throughput of an OpenSearch cluster. The overhead may vary depending on the number of indices, the use of aliases, the number of roles and the size of the user object. The goal of these changes is to improve privilege evaluation performance and to make it less dependent on the number of indices, etc.

What is the old behavior before changes and new behavior after changes?

No significant behavioral changes in the "happy case", when privileges are present.

The undocumented config option config.dynamic.multi_rolespan_enabled is no longer evaluated. The code now behaves like it is always set to true - that is the former default. See #4495 for details.

Some slight changes are present in error cases:

More detailed error messages for missing privileges, showing a index/action matrix of missing privileges
Errors in the role configuration might be reported (as error log messages) more early, directly after the configuration was applied
The DLS/FLS implementation now defaults to a "deny by default" implementation. This is not relevant for normal cases. This will be only relevant if index requests pass through privileges evaluator even though there are no roles which grant privileges to the requested indices. Note: This would only happen in case of a bug in the code. In the previous versions, the DLS/FLS implementation would grant full access to the indices. Now, the DLS/FLS implementation acts as a second barrier, denying access to the indices.

Issues Resolved

Optimized Privilege Evaluation #3870

Testing

Each new component is accompanied by its own unit test.
The high-level functionality is validated by the existing integration tests
Mixed cluster behavior is verfied by SecurityBackwardsCompatibilityIT (extended in Fixed bulk index requests in BWC tests and hardened assertions #4817 )

Check List

New functionality includes testing
New functionality has been documented
Commits are signed per the DCO using --signoff

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

nibix · 2024-05-30T10:09:19Z

Please have a look at the approaches. I would be very interested in your opinions.

As mentioned above, the implementation is not complete yet. The code contains a couple of TODO comments to indicate what work needs to be done.

I would also like to discuss whether a couple of things would be really necessary or whether there might be a chance to simplify the implementation by abolishing them.

These are:

At the moment, only RoleV7 and ActionGroupV7 configurations are supported by the class ActionPrivileges. I am wondering wether there are any plans to get rid of the *V6 configurations at some point in time. The figure V6 still stems from ODFE support for Elasticsearch 6. Are there really still OpenSearch users on this configuration? Additionally, the use of two impl classes per config type makes many unsafe casts of the generic SecurityDynamicConfiguration class necessary. If there would be only one impl class per config type, the APIs could be designed much safer.
In config.yml, the semantics of roles.yml can be changed by setting multi_rolespan_enabled to false. The OpenSearch docs do not mention this flag. In my perception, there is no real use of this setting except maintaining backwards compatiblity. However, for OpenSearch, the default of was always true since its inception. Are there really users having it set to false?

nibix · 2024-05-30T10:11:31Z

I have also started to work on the micro benchmarks as discussed in #3903. The generally accepted standard for micro benchmarks in Java is the JMH framework. However, this is licensed as GPL v2 with classpath exception: https://github.com/openjdk/jmh/blob/master/LICENSE Is the inclusion of a dependency with such a license acceptable in OpenSearch?

src/main/java/org/opensearch/security/privileges/CheckTable.java

peternied · 2024-05-30T19:06:45Z

I have also started to work on the micro benchmarks as discussed in #3903. The generally accepted standard for micro benchmarks in Java is the JMH framework. However, this is licensed as GPL v2 with classpath exception: https://github.com/openjdk/jmh/blob/master/LICENSE Is the inclusion of a dependency with such a license acceptable in OpenSearch?

@cwperks Can you look into this?

peternied

Thanks for the great work here @nibix, looking forward to seeing the micro-benchmark numbers and seeing the test start passing again :D

I managed to do a high level pass of the changes, but notably did not look into depth on ActionPrivileges and FlattenedActionGroups.

src/test/java/org/opensearch/security/privileges/ActionPrivilegesTest.java

src/main/java/org/opensearch/security/privileges/UserAttributes.java

src/main/java/org/opensearch/security/privileges/ProtectedIndexAccessEvaluator.java

src/main/java/org/opensearch/security/privileges/PrivilegesEvaluatorResponse.java

cwperks · 2024-05-30T19:30:25Z

@cwperks Can you look into this?

We're looking into this and will get back with an answer.

DarshitChanpura · 2024-06-11T15:32:44Z

However, this is licensed as GPL v2 with classpath exception: https://github.com/openjdk/jmh/blob/master/LICENSE Is the inclusion of a dependency with such a license acceptable in OpenSearch?

The feedback we received was that the code can be used only for internal operations. Since JMH usage will be part of Open-source security, my understanding is that this is not approved.

codecov · 2024-06-28T15:08:04Z

Codecov Report

Attention: Patch coverage is 86.08949% with 286 lines in your changes missing coverage. Please review.

Project coverage is 72.46%. Comparing base (811f26d) to head (1fef11b).
Report is 2 commits behind head on main.

Files with missing lines	Patch %	Lines
...search/security/configuration/DlsFlsValveImpl.java	62.20%	41 Missing and 24 partials ⚠️
...security/configuration/DlsFlsFilterLeafReader.java	43.90%	42 Missing and 4 partials ⚠️
...earch/security/privileges/PrivilegesEvaluator.java	69.89%	17 Missing and 11 partials ⚠️
...earch/security/privileges/dlsfls/FieldMasking.java	86.50%	11 Missing and 11 partials ⚠️
...urity/privileges/dlsfls/FlsStoredFieldVisitor.java	51.35%	17 Missing and 1 partial ⚠️
...ensearch/security/privileges/ActionPrivileges.java	95.77%	9 Missing and 8 partials ⚠️
.../opensearch/security/OpenSearchSecurityPlugin.java	53.84%	10 Missing and 2 partials ⚠️
...ecurity/privileges/dlsfls/DlsFlsLegacyHeaders.java	82.85%	0 Missing and 12 partials ⚠️
...figuration/SecurityFlsDlsIndexSearcherWrapper.java	70.58%	6 Missing and 4 partials ⚠️
...privileges/dlsfls/AbstractRuleBasedPrivileges.java	97.63%	3 Missing and 4 partials ⚠️
... and 20 more

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #4380      +/-   ##
==========================================
+ Coverage   70.90%   72.46%   +1.55%     
==========================================
  Files         310      324      +14     
  Lines       20950    21764     +814     
  Branches     3331     3453     +122     
==========================================
+ Hits        14855    15771     +916     
+ Misses       4341     4241     -100     
+ Partials     1754     1752       -2

Files with missing lines	Coverage Δ
...ty/configuration/ConfigurationLoaderSecurity7.java	`70.24% <100.00%> (+0.24%)`	⬆️
...ecurity/configuration/ConfigurationRepository.java	`76.51% <100.00%> (-2.66%)`	⬇️
...urity/configuration/PrivilegesInterceptorImpl.java	`59.77% <100.00%> (+2.45%)`	⬆️
...va/org/opensearch/security/configuration/Salt.java	`100.00% <ø> (ø)`
...rity/configuration/SystemIndexSearcherWrapper.java	`91.52% <100.00%> (+0.14%)`	⬆️
...nsearch/security/dlic/rest/api/RolesApiAction.java	`95.83% <100.00%> (-2.21%)`	⬇️
...org/opensearch/security/filter/SecurityFilter.java	`65.87% <100.00%> (ø)`
...rity/privileges/ExpressionEvaluationException.java	`100.00% <100.00%> (ø)`
...ch/security/privileges/PitPrivilegesEvaluator.java	`96.15% <100.00%> (-0.15%)`	⬇️
...es/PrivilegesConfigurationValidationException.java	`100.00% <100.00%> (ø)`
... and 40 more

... and 6 files with indirect coverage changes

nibix · 2024-07-04T10:07:19Z

@cwperks @peternied @DarshitChanpura @scrawfor99

Just FYI:

I worked a bit on the micro benchmarking part of this issue. As JMH was out due to its license, I reviewed other frameworks. It is remarkable that in most cases the descriptions of the frameworks will say "rather use JMH instead of this framework".

Anyway, I tried out https://github.com/noconnor/JUnitPerf because the idea of using JUnit infrastructure seemed to be nice. The big downside of JUnitPerf is that it does not work well together with parameterized JUnit tests.

See here for an example:

https://github.com/opensearch-project/security/blob/004df3bbdc69514f0c95acd2a1653a01e71758b9/src/performanceTest/java/org/opensearch/security/privileges/PrivilegesEvaluatorPerformanceTest.java

The high number of very similar methods is caused by the lack of parameter support - in the end we need to test quite a few different dimensions (like number of indices, number of roles, etc), on the same operation.

As I was really keen on getting some broader result, I went on the "roll your own" path and quick threw together some naive micro benchmarking code. So, this is just a temporary thing, thus very messy, but it gives me some numbers. See here:

https://github.com/opensearch-project/security/blob/004df3bbdc69514f0c95acd2a1653a01e71758b9/src/performanceTest/java/org/opensearch/security/privileges/PrivilegesEvaluatorPeformanceTest2.java

So, I let run some tests and here are some preliminary results.

Micro benchmark test results

Disclaimer

Generally, the real world meaningfulness of micro benchmarks is limited. On a full real cluster, this can look totally different due to:

Proportion of effects to other time consuming operations
Effects caused by garbage collection, thread synchronization or JIT
Different hardware which can sustain constant CPU load much better than the consumer system I used to run the benchmarks

On the other hand, micro benchmarks make some tests so much easier. For micro benchmarking, a Metadata instance with 100000 indices can be mocked within a few seconds. On the other hand, creating so many indices on a real cluster would take much, much longer.

Full cluster benchmarks are also coming up, but these are still in the works.

Scope

The micro benchmarks were applied to the following code:

security/src/performanceTest/java/org/opensearch/security/privileges/PrivilegesEvaluatorPeformanceTest2.java

Lines 501 to 512 in 004df3b

    
           try { 
        
               PrivilegesEvaluationContext context = subject.createContext( 
        
                       user(user), 
        
                       requestParameters.action, 
        
                       requestParameters.getRequest(this.random), 
        
                       null, 
        
                       null 
        
               ); 
        
               PrivilegesEvaluatorResponse response = subject.evaluate(context); 
        
               if (!response.isAllowed()) { 
        
                   throw new RuntimeException("Assertion failed: " + response);

For comparison, we also applied the micro benchmarks to the following code on the old code base:

https://github.com/nibix/security/blob/300d138578ef853071d649d647335d8430320f14/src/performanceTest/java/org/opensearch/security/privileges/PrivilegesEvaluatorPeformanceTest2.java#L502-L510

Due to refactorings, the code looks different. However, what happens under the hood is effectively the same.

Additionally some further code changes were necessary to make PrivilegeEvaluator independent from dependencies like ClusterService in order to make it really unit testable/benchmarkable. I first tried to use Mockito to mock ClusterService instances but had to learn that the performance characteristics of Mockito are so bad that it is unsuitable for micro benchmarking.

As we only look at the evaluate() method, DLS/FLS evaluation is disabled for this scope.

Tested dimensions

Action requests

We tested privilege evaluation with three different actions:

indices:data/write/bulk[s] with BulkShardRequest
- with 10 bulk items
- with 1000 bulk items
indices:data/write/bulk with BulkRequest
indices:data/read/search with SearchRequest
- with an index pattern that matches 2% of all indices (randomized)
- with an index pattern that matches 20% of all indices (randomized)

Number of indices on cluster

We tested with these indices:

10 indices:index_a0, index_a1, index_b0, index_b1, index_c0, index_c1, ... , index_e0, index_e1
30 indices: index_a0, ..., index_a5, ... , index_e0, ... index_e5
100 indices: index_a0, ..., index_a19, ... , index_e0, ... index_e19
300 indices
1000 indices
3000 indices
10000 indices
30000 indices
100000 indices

Different user configurations

A user with full privileges (using * for index_permissions and cluster_permssions)
A user with a single role giving CRUD permissions on index_a* and index_b*
A user with 20 roles giving CRUD permissions individually on index_a0, index_a1, ...
A user with 40 roles in total: 20 roles giving READ permissions individually on index_a0, index_a1, ... and 20 more roles giving WRITE permissions on the same indices
A user with a single role which uses a regex index pattern with a user attribute. This is interesting because it makes certain pre-computations impossible and requires to re-evaluate the index patterns for each request.

Results

The raw result data can be found here: https://docs.google.com/spreadsheets/d/1Hd6pZFICTeplXIun3UpEplANAwQDE0jXbqLnnJz61AI/edit?usp=sharing

In the shards below, dashed lines indicate the performance of the old privilege evaluation code on a particular combination of test dimensions. Solid lines with the same color indicate the performance of the new code with the same test dimensions. The x-axis represents the number of indices on the cluster, the y-axis represents the throughput in operations per second.

`bulk[s]`, `BulkShardRequest`

The performance of BulkShardRequests is the most interesting factor on clusters doing heavy ingestion. A single bulk requests will be broken down into the individual indices and shards, resulting in quite a few BulkShardRequests for which the privilege evaluation needs to be done in parallel, thus performance characteristics here have a high impact.

The privilege evaluation for the top level BulkRequest is less interesting because it is just an index-independent cluster privilege, which is easy to evaluate. Still, we will also review this below.

Requests with 10 items

Requests with 1000 items

Observation

The performance of the old code degrades with the increasing number of indices. Starting with 30000 indices, we have a method call latency which is > 10 ms. This is where users on ingestion heavy clusters often start to experience performance issues and the method calls start to show up in the hot thread dumps.

In contrast, the throughput of the new code stays constant, independent of the number of indices. It can be seen that the number of roles still has quite an effect on the throughput. But here we talk about time differences below 0.1 ms, which should not be significant in a real world cluster.

`bulk`, `BulkRequest`

The top level bulk action is a cluster action, so it does not require considering the indices on a cluster.

Observation

As expected, performance is independent of number of indices, both on the new implementation and on the old implementation. However, the new implementation improves throughput by a factor between 2 and 3.

`search`, `SearchRequest`

Search operations become interesting when there are monitoring/alerting solutions issuing search requests on broad index patterns in short time intervals.

Search with search patterns that match 2% of the indices

Search with search patterns that match 20% of the indices

Observation

Both the old and new code degrade with the growing number of indices. Profiling shows that this is mostly not due to privilege evaluation, but due to the index pattern expression resolution.

However, the new code retains method call latencies below 20 ms even on clusters with 100000 indices. The old code however, takes up to 5 seconds for a single method call on clusters with 100000 indices.

See the following chart for a zoomed in section of the 2% of indices case for 10000-100000 indices:

cwperks · 2024-07-17T18:58:20Z

@nibix thank you for the update. The results of bulk indexing in a cluster with a large number of indices is great to see and I like how the test cases isolate PrivilegeEvaluation for performance testing. Is there any work remaining before labeling this PR ready for review?

nibix · 2024-07-17T20:16:40Z

@cwperks

Is there any work remaining before labeling this PR ready for review?

Yes. Actually, I was also about to ping you regarding this.

The heap used by the de-normalized data structure needs to be capped (see Optimized Privilege Evaluation #3870 (comment) )
We need to come to a conclusion on the breaking changes (see [RFC] Retire support for V6 configuration #4493 and [RFC] Retire support for config option config.dynamic.multi_rolespan_enabled in config.yml #4495 ). At the moment, this only supports RoleV7. Also, multi_rolespan_enabled: false is unsupported. Of course it would be nice to have the benefits already available for OpenSearch 2. But that would mean either additional effort or the acceptance of some breaking changes. I have sketched a way to retire RoleV6 without breaking changes in Remove support for v6 configuration #4546 . However, I am at the moment not sure how a solution could look like that supports multi_rolespan_enabled: false in the new code.
We are also working on actual full cluster benchmarks, which give you concrete figures on the actual throughput improvements.

Note: This leaves the DLS/FLS implementation unchanged at the moment. Thus, we will still have a part of the problematic performance characteristics. However, I would still get this merged and then add DLS/FLS support in a further PR to keep things (relatively) small.

nibix · 2024-09-02T09:18:25Z

Benchmark results

After having shared the micro benchmarks before, I can now share results of benchmarks performed on an actual cluster. This brings the following benefits:

It puts the performance gains into perspective of the performance characteristics of the whole software. This way it gets apparent whether the performance gains actually have significance or if they just "vanish" behind other dominating performance characteristics.
We can also test the performance gains for DLS. With micro benchmarks, this is not possible as DLS is not implemented as an isolated piece of code, but is performed distributed over the cluster during a search operation.

Disclaimer

The usual benchmarking disclaimer: Of course, these figures can only represent very specific scenarios. Other scenarios can look quite different, as there are so many variables which can be different. Yet, especially the "slow parts" of the benchmark results can give one an impression where real performance issues are.

Test environment

We ran the tests on an OpenSearch cluster hosted using Kubernetes. The host machine for the K8S nodes was a physical server with an AMD EPYC 7401P CPU. We ran an OpenSearch cluster with 3 nodes, each node had 64 GB of RAM, 32 GB of Java Heap, and a cpu: 8 limit configured in K8S. The version of OpenSearch was a recent snapshot of main.

Tested dimensions

Operations

We benchmarked the following operations:

Ingestion with the REST _bulk API into a single index with 8 parallel clients
- with 10 bulk items per request
- with 1000 bulk items per request
Search requests with the REST _search API with 16 parallel clients
- on exactly one index
- on an index expression matching 2% of the indices on the cluster
- on an index expression matching 20% of the indices on the cluster

Number of indices on cluster

We tested with these indices:

10 indices: index_a0, index_a1, index_b0, index_b1, index_c0, index_c1, ... , index_e0, index_e1
30 indices: index_a0, ..., index_a5, ... , index_e0, ... index_e5
100 indices: index_a0, ..., index_a19, ... , index_e0, ... index_e19
300 indices
1000 indices
3000 indices
10000 indices

User configurations

We tested with differently configured users to find out about the effects of complexity of roles, DLS rules, etc. The users were:

A user identified by the super admin certificate. Using this certificate, most of the privilege evaluation code is by-passed. This gives us a rough "upper limit" for the possible performance.
A user with full privileges (using * for index_permissions and cluster_permssions)
A user with a single role giving CRUD permissions on index_a* and index_b*
A user with 20 roles giving CRUD permissions individually on index_a0, index_a1, ...
A user with 40 roles in total: 20 roles giving READ permissions individually on index_a0, index_a1, ... and 20 more roles giving WRITE permissions on the same indices
A user with a single role giving CRUD permissions on index_a* and index_b* and additionally a simple DLS query

Test results

A commented summary of the results follows in the upcoming sections.

The raw test results can be also reviewed at https://docs.google.com/spreadsheets/d/16VRr9B2bPTyyS_T-IZobUG3C0uJxnXRxlbmsdH_MeCg/edit?usp=sharing

Indexing throughput

Bulk size 10

In this and the following charts, the dashed lines represent OpenSearch with the standard security plugin. The solid lines represent OpenSearch with the security plugin with the optimized privilege evaluation code.

The blue line represents requests authenticated by the super admin certificate, which by-passes most of privilege evaluation. Thus, the blue line forms a kind of "hull curve", it can be seen as a rough theoretical maximum from a security plugin POV.

The green lines represent users with full privileges. Yellow lines represent users with limited privileges - the darker the yellow, the more complex the role configuration.

The benchmark results for the standard security plugin show a clear performance decline starting at about 300 indices on the cluster. This is caused by the privilege evaluation code resolving the index patterns in the roles configuration against all cluster indices for each request. At 1000 indices, we only get roughly 60% of the throughput that was observed for 10 indices.

Additionally, it can be seen that growing role complexity has a clear effect on throughput. More complex roles show a significant lower throughput.

On the other hand, the optimized security plugin shows a mostly linear throughput, which is independent of the number of indices. There is a slight decline starting from 3000 indices. The reasons for this are unknown so far. Yet, these values are still well above the standard plugin.

Bulk size 1000

Larger bulk sizes shift the place where the gap between standard and optimized plugin performance opens a bit to the right. Here we see a strong performance decline in the standard security plugin starting from 1000 indices.

The optimized security plugin exhibits better performance in all cases, though. Additionally, the performance decline that was visible for the bulk: 10 case is not really visible here any more.

The charts show a low throughput for the full privileges user both for the standard plugin and for the optimized plugin in the indices: 10 case. However, I would guess that this is some artifact of the used benchmarking process and is not a real performance characteristic.

Search throughput

Search on single index

The search throughput graphs introduce one further user: Users symbolized by the purple line are users which do have a document level access restriction implemented with a DLS query in one role.

Like for the bulk indexing, the standard plugin shows a declining performance with increasing number of indices on the clusters. Also here, the complexity of the roles configuration has a very significant effect on the throughput. Especially the user with DLS exhibits heavy performance degradations. With 300 indices, the DLS user shows less than half the throughput of the full privileges user of the standard plugin.

On the other hand, the optimized plugin shows mostly constant performance characteristics, independent of the number of indices. Also the DLS user does not show any significant degradation of performance.

Search on 2% of indices

When an index pattern comes into play, both the standard plugin and the optimized plugin show a performance degradation with a growing number of indices. This is due to the necessary resolution of the index pattern against all indices on the cluster.

The blue line - the super admin user - shows that there is quite a gap (about 20%, growing with higher number of indices) between the theoretically possible throughput and also the optimized plugin. This is likely due to the code in https://github.com/nibix/security/blob/main/src/main/java/org/opensearch/security/resolver/IndexResolverReplacer.java which still leaves some room for improvement.

Still, the optimized plugin delivers a clearly higher throughput in all cases.

Especially the DLS performance has strongly improved. For 1000 indices, we see a throughput of 1035 operations per second on the optimized plugin. The standard plugin just manages 53 operations per second with a service time of 300 ms per request. With 10000 indices, DLS gets virtually unusable on the standard plugin with just 0.6 operations per second and a service time of 16 seconds. The optimized plugin still delivers 99 operations per second.

Search on 20% of indices

With a search operation on 20% of indices, the index resolution performance gets so dominant that significant performance gains are no longer visible - except for DLS, which still shows very strong improvements. Using DLS with the standard plugin delivers a throughput of 7 operations per second already for just 1000 indices (service time 2.2 seconds). The optimized plugin still delivers a throughput of 113 operations per second (service time 176 ms).

The blue line - the admin cert user - shows that there is still room for improvement, though.

nibix · 2024-10-17T09:28:34Z

Side note: This is not the end of the story - there's still significant potential for performance improvements. I will file issues about that soon.

nibix · 2024-10-17T10:08:45Z

Note: The patch coverage is only 84.5% which is a bit low for my taste: #4380 (comment)

However, this low figure is mostly due to indentation changes in DLS/FLS classes due to newly necessary exception handlers. Thus, the low coverage of these classes leaks into this patch.

I have a couple of ideas on how to improve coverage for these classes, but I guess that this is outside of the scope of this PR.

cwperks · 2024-10-17T13:04:03Z

Side note: This is not the end of the story - there's still significant potential for performance improvements. I will file issues about that soon.

I look forward to seeing the new issues filed. Great detective work @nibix !

src/main/java/org/opensearch/security/privileges/dlsfls/AbstractRuleBasedPrivileges.java

Signed-off-by: Nils Bandener <nils.bandener@eliatra.com>

…aluation Signed-off-by: Nils Bandener <nils.bandener@eliatra.com>

Signed-off-by: Nils Bandener <nils.bandener@eliatra.com>

build.gradle

src/integrationTest/java/org/opensearch/security/privileges/dlsfls/FlsDocumentFilterTest.java

src/main/java/org/opensearch/security/OpenSearchSecurityPlugin.java

src/main/java/org/opensearch/security/privileges/dlsfls/DlsFlsProcessedConfig.java

cwperks · 2024-10-24T18:17:34Z

src/main/java/org/opensearch/security/privileges/PrivilegesEvaluatorResponse.java

    public Set<String> getMissingPrivileges() {
-        return new HashSet<String>(missingPrivileges);
+        return this.indexToActionCheckTable != null ? this.indexToActionCheckTable.getIncompleteColumns() : Collections.emptySet();


This will always be empty if the checkTable is null? In what scenarios can the checktable be null?

I think this is the last remaining case where checktable is null:

security/src/main/java/org/opensearch/security/privileges/PrivilegesEvaluator.java

Lines 328 to 343 in 1fef11b

PrivilegesEvaluatorResponse presponse = new PrivilegesEvaluatorResponse();

final String injectedRolesValidationString = threadContext.getTransient(

ConfigConstants.OPENDISTRO_SECURITY_INJECTED_ROLES_VALIDATION

);

if (injectedRolesValidationString != null) {

HashSet<String> injectedRolesValidationSet = new HashSet<>(Arrays.asList(injectedRolesValidationString.split(",")));

if (!mappedRoles.containsAll(injectedRolesValidationSet)) {

presponse.allowed = false;

presponse.missingSecurityRoles.addAll(injectedRolesValidationSet);

log.info("Roles {} are not mapped to the user {}", injectedRolesValidationSet, user);

return presponse;

}

mappedRoles = ImmutableSet.copyOf(injectedRolesValidationSet);

context.setMappedRoles(mappedRoles);

}

Indeed, the concept of missing privileges in not applicable there.

Of course, I can also try to refactor this, but I am not sure if that's really in the scope of the PR.

Should we be addressing this in a follow-up? Seems like a good piece to refactor

Yes, cleaning up PrivilegeEvaluatorResponse is a good idea!

cwperks · 2024-10-24T18:27:44Z

src/main/java/org/opensearch/security/privileges/ActionPrivileges.java

+
+                                    if (roleSetBuilder.getEstimatedByteSize() + indexMapBuilder
+                                        .getEstimatedByteSize() > statefulIndexMaxHeapSize.getBytes()) {
+                                        log.info(


codecov is showing that these lines are not hit. Are there any tests that cover this scenario?

cwperks · 2024-10-25T12:29:15Z

src/main/java/org/opensearch/security/privileges/PrivilegesEvaluatorResponse.java

+        return response;
+    }
+
+    public static PrivilegesEvaluatorResponse insufficient(


If the second arg is not needed here can it be removed?

Done in c474bf7

cwperks · 2024-10-25T13:05:25Z

src/main/java/org/opensearch/security/privileges/PrivilegesEvaluator.java

@@ -262,7 +325,7 @@ public PrivilegesEvaluatorResponse evaluate(PrivilegesEvaluationContext context)
            action0 = PutMappingAction.NAME;
        }

-        final PrivilegesEvaluatorResponse presponse = new PrivilegesEvaluatorResponse();
+        PrivilegesEvaluatorResponse presponse = new PrivilegesEvaluatorResponse();


It looks like this presponse object is primarily used for exiting early from privileges evaluation, like in the case of snapshot, system index, protected index and PIT privilege. Since its using one of the constructors directly, the error message for privilege evaluation will not include the missing privileges. Do you think it makes sense to return the missing privileges for such cases?

i.e.

if (snapshotRestoreEvaluator.evaluate(request, task, action0, clusterInfoHolder, presponse).isComplete()) { if (!presponse.isAllowed()) { return PrivilegesEvaluatorResponse.insufficient(action0, context); } else { return presponse; } }

Looking at SnapshotRestoreEvaluator, there are actually quite a few different error scenarios:

The operation is not allowed at all for normal users

The operation includes protected indices

The actual presence of the permission is actually not checked in the SnapshotRestoreEvaluator, this is checked below in the "normal" privilege evaluation support.

Thus, it would be misleading to report the restore action as missing privilege, as adding that privilege won't fix the error.

However, one could extend SnapshotRestoreEvaluator to include the reason of the error in the reason attribute of presponse. That is however IMHO outside of the scope of this PR.

src/main/java/org/opensearch/security/privileges/ActionPrivileges.java

DarshitChanpura

Thank you @nibix for improving this feature. I took initial pass and have left comments, most of which are clarification questions.

src/main/java/org/opensearch/security/OpenSearchSecurityPlugin.java

src/integrationTest/java/org/opensearch/security/privileges/ActionPrivilegesTest.java

src/main/java/org/opensearch/security/securityconf/impl/SecurityDynamicConfiguration.java

DarshitChanpura · 2024-10-29T03:32:41Z

src/main/java/org/opensearch/security/privileges/IndexPattern.java

+    private IndexPattern(WildcardMatcher staticPattern, ImmutableList<String> patternTemplates, ImmutableList<String> dateMathExpressions) {
+        this.staticPattern = staticPattern;
+        this.patternTemplates = patternTemplates;
+        this.dateMathExpressions = dateMathExpressions;


why require a separate variable for these? Is it for daily-rotating index or something similar?

It is a feature of the index_pattern attribute in the roles configuration to support also date math expressions - it has been there from the beginning.

I guess you could use it for defining that users are just allowed to see the data from today's index, but not from indices in the past. Yet, I find the use of date math for this purpose questionable, as it lacks expressiveness.

So, this is just for backwards compat.

See also the comment here:

security/src/main/java/org/opensearch/security/privileges/IndexPattern.java

Lines 87 to 88 in 1fef11b

// Note: The use of date math expressions in privileges is a bit odd, as it only provides a very limited

// solution for the potential user case. A different approach might be nice.

TIL. Hmm, I never have used it so didn't understand the comment fully.

src/main/java/org/opensearch/security/privileges/PrivilegesEvaluator.java

DarshitChanpura · 2024-10-29T03:49:43Z

src/main/java/org/opensearch/security/privileges/PrivilegesEvaluatorResponse.java

    public Set<String> getMissingPrivileges() {
-        return new HashSet<String>(missingPrivileges);
+        return this.indexToActionCheckTable != null ? this.indexToActionCheckTable.getIncompleteColumns() : Collections.emptySet();


Should we be addressing this in a follow-up? Seems like a good piece to refactor

src/main/java/org/opensearch/security/privileges/WellKnownActions.java

Signed-off-by: Nils Bandener <nils.bandener@eliatra.com>

nibix mentioned this pull request May 30, 2024

Optimized Privilege Evaluation #3870

Open

nibix commented May 30, 2024

View reviewed changes

src/main/java/org/opensearch/security/privileges/CheckTable.java Outdated Show resolved Hide resolved

peternied reviewed May 30, 2024

View reviewed changes

This was referenced Jun 8, 2024

Extracted the user attr handling methods from ConfigModelV7 into its own class #4416

Merged

Replaced uses of SecurityRoles by Set<String> mappedRoles where the SecurityRoles functionality is not needed #4432

Merged

nibix mentioned this pull request Jun 12, 2024

New algorithm for resolving action groups #4448

Merged

3 tasks

nibix force-pushed the optimized-privilege-evaluation branch 2 times, most recently from d128425 to 1714bd7 Compare June 20, 2024 15:27

nibix mentioned this pull request Jun 24, 2024

do_not_fail_on_forbidden mode introduces inconsistencies for mget, msearch and similar actions #4485

Closed

nibix force-pushed the optimized-privilege-evaluation branch from 1714bd7 to 764b826 Compare June 25, 2024 07:29

nibix mentioned this pull request Jun 26, 2024

Separated DLS/FLS privilege evaluation from action privilege evaluation #4490

Merged

3 tasks

nibix force-pushed the optimized-privilege-evaluation branch from b5ff5c8 to 004df3b Compare July 4, 2024 09:05

nibix force-pushed the optimized-privilege-evaluation branch 2 times, most recently from 30fcc0f to 41dd986 Compare July 16, 2024 09:30

nibix force-pushed the optimized-privilege-evaluation branch from 7adb281 to 504a157 Compare July 17, 2024 18:14

nibix mentioned this pull request Jul 19, 2024

Fixed READ_ACTIONS required by TermsAggregationEvaluator #4582

Merged

1 task

nibix force-pushed the optimized-privilege-evaluation branch 2 times, most recently from b30b530 to 78db68b Compare August 2, 2024 10:17

nibix force-pushed the optimized-privilege-evaluation branch 2 times, most recently from fca7058 to fcc6183 Compare September 2, 2024 07:45

nibix force-pushed the optimized-privilege-evaluation branch from 3df0a95 to 1964b4d Compare October 17, 2024 09:19

cwperks reviewed Oct 17, 2024

View reviewed changes

src/main/java/org/opensearch/security/privileges/dlsfls/AbstractRuleBasedPrivileges.java Show resolved Hide resolved

nibix added 10 commits October 21, 2024 20:19

Optimized privilege evaluation

ed13b9b

Signed-off-by: Nils Bandener <nils.bandener@eliatra.com>

Optimized DLS/FLS

b64f535

Signed-off-by: Nils Bandener <nils.bandener@eliatra.com>

Removed code which is no longer needed

4cb4a5f

Signed-off-by: Nils Bandener <nils.bandener@eliatra.com>

Made addStatics() generic

d752c93

Signed-off-by: Nils Bandener <nils.bandener@eliatra.com>

Review: Properly abort in case a restricted role fails during rule ev…

2327925

…aluation Signed-off-by: Nils Bandener <nils.bandener@eliatra.com>

Tests and cleanup

a6ea16c

Signed-off-by: Nils Bandener <nils.bandener@eliatra.com>

Tests and cleanup

7aed7c5

Signed-off-by: Nils Bandener <nils.bandener@eliatra.com>

Applied changes from opensearch-project#4826

4d07013

Signed-off-by: Nils Bandener <nils.bandener@eliatra.com>

Applied spotless

6212572

Signed-off-by: Nils Bandener <nils.bandener@eliatra.com>

Applied spotless

1fef11b

Signed-off-by: Nils Bandener <nils.bandener@eliatra.com>

nibix force-pushed the optimized-privilege-evaluation branch from f59eacf to 1fef11b Compare October 21, 2024 18:34

cwperks reviewed Oct 22, 2024

View reviewed changes

build.gradle Show resolved Hide resolved

cwperks reviewed Oct 22, 2024

View reviewed changes

src/integrationTest/java/org/opensearch/security/privileges/dlsfls/FlsDocumentFilterTest.java Show resolved Hide resolved

cwperks reviewed Oct 22, 2024

View reviewed changes

src/main/java/org/opensearch/security/OpenSearchSecurityPlugin.java Show resolved Hide resolved

cwperks reviewed Oct 22, 2024

View reviewed changes

src/main/java/org/opensearch/security/privileges/dlsfls/DlsFlsProcessedConfig.java Show resolved Hide resolved

cwperks reviewed Oct 24, 2024

View reviewed changes

cwperks reviewed Oct 25, 2024

View reviewed changes

src/main/java/org/opensearch/security/privileges/ActionPrivileges.java Show resolved Hide resolved

DarshitChanpura reviewed Oct 29, 2024

View reviewed changes

nibix added 4 commits October 30, 2024 07:04

Removed unnecessary parameter

c474bf7

Signed-off-by: Nils Bandener <nils.bandener@eliatra.com>

Added comment

de14c67

Signed-off-by: Nils Bandener <nils.bandener@eliatra.com>

Added comment

ca66928

Signed-off-by: Nils Bandener <nils.bandener@eliatra.com>

Code restructuring

45f40b8

Signed-off-by: Nils Bandener <nils.bandener@eliatra.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimized Privilege Evaluation #4380

Optimized Privilege Evaluation #4380

nibix commented May 30, 2024 •

edited

Loading

nibix commented May 30, 2024

nibix commented May 30, 2024

peternied commented May 30, 2024

peternied left a comment

cwperks commented May 30, 2024

DarshitChanpura commented Jun 11, 2024

codecov bot commented Jun 28, 2024 •

edited

Loading

nibix commented Jul 4, 2024 •

edited

Loading

cwperks commented Jul 17, 2024

nibix commented Jul 17, 2024

nibix commented Sep 2, 2024 •

edited

Loading

nibix commented Oct 17, 2024

nibix commented Oct 17, 2024

cwperks commented Oct 17, 2024

cwperks Oct 24, 2024

nibix Oct 25, 2024

DarshitChanpura Oct 29, 2024

nibix Oct 29, 2024

cwperks Oct 24, 2024

cwperks Oct 25, 2024

nibix Oct 30, 2024

cwperks Oct 25, 2024

nibix Oct 25, 2024

DarshitChanpura left a comment

DarshitChanpura Oct 29, 2024

nibix Oct 29, 2024

DarshitChanpura Oct 29, 2024

DarshitChanpura Oct 29, 2024

	PrivilegesEvaluatorResponse presponse = new PrivilegesEvaluatorResponse();

	final String injectedRolesValidationString = threadContext.getTransient(
	ConfigConstants.OPENDISTRO_SECURITY_INJECTED_ROLES_VALIDATION
	);
	if (injectedRolesValidationString != null) {
	HashSet<String> injectedRolesValidationSet = new HashSet<>(Arrays.asList(injectedRolesValidationString.split(",")));
	if (!mappedRoles.containsAll(injectedRolesValidationSet)) {
	presponse.allowed = false;
	presponse.missingSecurityRoles.addAll(injectedRolesValidationSet);
	log.info("Roles {} are not mapped to the user {}", injectedRolesValidationSet, user);
	return presponse;
	}
	mappedRoles = ImmutableSet.copyOf(injectedRolesValidationSet);
	context.setMappedRoles(mappedRoles);
	}

	// Note: The use of date math expressions in privileges is a bit odd, as it only provides a very limited
	// solution for the potential user case. A different approach might be nice.

Optimized Privilege Evaluation #4380

Are you sure you want to change the base?

Optimized Privilege Evaluation #4380

Conversation

nibix commented May 30, 2024 • edited Loading

Description

Issues Resolved

Testing

Check List

nibix commented May 30, 2024

nibix commented May 30, 2024

peternied commented May 30, 2024

peternied left a comment

Choose a reason for hiding this comment

cwperks commented May 30, 2024

DarshitChanpura commented Jun 11, 2024

codecov bot commented Jun 28, 2024 • edited Loading

Codecov Report

nibix commented Jul 4, 2024 • edited Loading

Micro benchmark test results

Disclaimer

Scope

Tested dimensions

Action requests

Number of indices on cluster

Different user configurations

Results

bulk[s], BulkShardRequest

Requests with 10 items

Requests with 1000 items

Observation

bulk, BulkRequest

Observation

search, SearchRequest

Search with search patterns that match 2% of the indices

Search with search patterns that match 20% of the indices

Observation

cwperks commented Jul 17, 2024

nibix commented Jul 17, 2024

nibix commented Sep 2, 2024 • edited Loading

Benchmark results

Disclaimer

Test environment

Tested dimensions

Operations

Number of indices on cluster

User configurations

Test results

Indexing throughput

Bulk size 10

Bulk size 1000

Search throughput

Search on single index

Search on 2% of indices

Search on 20% of indices

nibix commented Oct 17, 2024

nibix commented Oct 17, 2024

cwperks commented Oct 17, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

DarshitChanpura left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

nibix commented May 30, 2024 •

edited

Loading

codecov bot commented Jun 28, 2024 •

edited

Loading

nibix commented Jul 4, 2024 •

edited

Loading

`bulk[s]`, `BulkShardRequest`

`bulk`, `BulkRequest`

`search`, `SearchRequest`

nibix commented Sep 2, 2024 •

edited

Loading